Incremental Cosine Computations for Search and Exploration of Tag Spaces
نویسندگان
چکیده
Tags are often used to describe user-generated content on the Web. However, the available Web applications are not incrementally dealing with new tag information, which negatively influences their scalability. Since the cosine similarity between tags represented as co-occurrence vectors is an important aspect of these frameworks, we propose two approaches for an incremental computation of cosine similarities. The first approach recalculates the cosine similarity for new tag pairs and existing tag pairs of which the co-occurrences has changed. The second approach computes the cosine similarity between two tags by reusing, if available, the previous cosine similarity between these tags. Both approaches compute the same cosine values that would have been obtained when a complete recalculation of the cosine similarities is performed. The performed experiments show that our proposed approaches are between 1.2 and 23 times faster than a complete recalculation, depending on the number of co-occurrence changes and new tags.
منابع مشابه
Improving Search and Exploration in Tag Spaces Using Automated Tag Clustering
In recent years we have experienced an increase in the usage of tags to describe resources. However, the free nature of tagging presents some challenges regarding the search and exploration of tag spaces. In order to deal with these challenges we propose the Semantic Tag Clustering Search (STCS) framework. The framework first groups syntactic variations using several measures based on the Leven...
متن کاملA semantic-based approach for searching and browsing tag spaces
In this thesis we propose the Semantic Tag Clustering Search framework (STCS). This framework consists of three parts. The first part deals with syntactic variations by clustering tags that are syntactic variations of each other and assigning a label to them. The second part of the framework addresses the problem of recognizing homonyms and identifying semantically related tags. The last, and f...
متن کاملScaling Pair-Wise Similarity-Based Algorithms in Tagging Spaces
Users of Web tag spaces, e.g., Flickr, find it difficult to get adequate search results due to syntactic and semantic tag variations. In most approaches that address this problem, the cosine similarity between tags plays a major role. However, the use of this similarity introduces a scalability problem as the number of similarities that need to be computed grows quadratically with the number of...
متن کاملShear Waves Through Non Planar Interface Between Anisotropic Inhomogeneous and Visco-Elastic Half-Spaces
A problem of reflection and transmission of a plane shear wave incident at a corrugated interface between transversely isotropic inhomogeneous and visco-elastic half-spaces is investigated. Applying appropriate boundary conditions and using Rayleigh’s method of approximation expressions for reflection and transmission coefficients are obtained for the first and second order approximation of the...
متن کاملA Cluster-Based Approach for Search and Exploration of Tag Spaces
Although Semantic Web technology is increasingly becoming more and more important, tagging remains a popular method to describe Web resources. Therefore it is important to address the issues that are found in current tagging search engines, such as Flickr. We find that the free nature of tagging results in many issues for tag search engines, such as synonyms, homonyms, syntactic variations, etc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012